Topological Properties of the Minimal Spanning Tree in the Korean and American 

Stock Markets 
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We investigate a factor that can affect the number of links of a specific stock in a network between 
stocks created by the minimal spanning tree (MST) method, by using individual stock data listed 
on the S&P500 and KOSPL Among the common factors mentioned in the arbitrage pricing model 
(APM), widely acknowledged in the financial field, a representative market index is established as 
a possible factor. We found that the correlation distribution, pij, of 400 stocks taken from the 
S&P500 index shows a very similar with that of the Korean stock market and those deviate from 
the correlation distribution of time series removed a nonlinearity by the surrogate method. We 
also shows that the degree distribution of the MSTs for both stock markets follows a power-law 
distribution with the exponent £ ~ 2.1, while the degree distribution of the time series eliminated 
a nonlinearity follows an exponential distribution with the exponent, 5 ~ 0.77. Furthermore the 
correlation, piM, between the degree k of individual stock, i, and the market index, M, follows 
a power-law distribution, (p«f(&)) ~ k 1 , with the exponent 7S&P500 ~ 0.16 and 7kospi ~ 0.14, 
respectively. Thus, regardless of the markets, the indivisual stocks closely related to the common 
factor in the market, the market index, are likely to be located around the center of the network 
between stocks, while those weakly related to the market index are likely to be placed in the outside. 

PACS numbers: 89.75.Fb, 89.65.Gh, 89. 75. He 
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I. INTRODUCTION 



Recently, researchers from diverse disciplines are show- 
ing great interest in the topological properties of net- 
works. In particular, the network observed in natural 
and social science shows a variety of properties different 
from those of random graphs [HQ • The economic world, 
known as having the most complex structure among 
them, evolves through the nonlinear-interaction of the 
diverse heterogeneous agents. The stock market is a 
representative example. The stock prices of individual 
companies are formed by a complex evolution process of 
diverse information generated in the market and these 
have strong correlations with each other by the common 
factors in the market [1, H, [f|. In other words, indi- 
vidual stocks are connected with each other and compa- 
nies with the same properties tend to be grouped into a 
community. To investigate these properties, Mantegna.et 
al. proposed the minimal spanning tree (MST) method, 
which can be used to observe the grouping process of 
individual stocks transacted in the market, on the basis 
of the correlation of stocks Mantegna constructed 
the stock network visually using the MST method and 
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found that this generated network between stocks has an 
economically significant grouping process @, 0] • 

The studies of the past several years showed that the 
degree distribution of the network created by the MST 
method follows a power-law distribution with the expo- 
nent £ « 2 [1, That is, most individual companies in 
the stock market have a small number of links with other 
stocks, while a few stocks have a great number of connec- 
tions. However the KOSPI200 companies of the Korean 
stock market, one of the emerging market, does not fol- 
low a power-law distribution and for the American stock 
market, S&P500, the relation between market capital- 
ization and \q\, the influence strength (IS), has a positive 
correlation, while the KOSPI200 has no correlation [Iol |. 
Moreover, previous results showed that the network of in- 
dividual stocks tends to gather around the companies of 
a homogeneous industrial group [ill, EH EH E3 • Further- 
more, the stocks forming a Markowitz efficient portfolio 
in the financial field are almost located at the outside of 
the network [Til. [l5|. However, until recently, studies on 
the possible factors important in determining the num- 
ber of linkages with other stocks in the network between 
stocks were insufficient. 

In order to investigate the topological properties in the 
network of the stock market, we used the MST method 
introduced by Mantegnaei al. We also consider the mar- 
ket index in terms of a possible factor that can affect the 
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number of links of a specific stock with other stocks in 
the network created by the MST method. We used the 
data of 400 individual companies listed on the S&P500 
index from January 1993 to May 2005, and 468 individ- 
ual companies listed on the KOSP1 from January 1991 
to May 2003. 

We found that the correlation distribution, p^j , of the 
400 stocks in S&P500 index shows a very similar with 
that of the KOSPI and those deviate from the correla- 
tion distribution of time series removed a nonlinearity 
by the surrogate method introduced by J. Theiler et al. 
[lq | for both stock markets. We also found that the de- 
gree distribution of the network, like those from previ- 
ous research, follows a power-law distribution with the 
exponent Cs&psoo, kospi ~ 2.1 for both the Korean and 
American stock markets. In order to observe the possible 
factor in determing the degree k on the MST network, we 
calculate the cross-correlation, PiM, between a individual 
stock and market index for both stock markets. We also 
found that the cross-correlation, piM between the mar- 
ket index and the companies with the degree k follows 
a power-law distribution, (piM(k)) ~ fc 7 , where the ex- 
ponents are calculated to be 7s&psoo ~ 0.16, 7kospi ~ 
0.14. In other words, individual stocks having many con- 
nections with other stocks in the network obtained by 
the MST method are more highly related to the market 
index than those having a comparatively small number 
of links. 

In the next section, we describe the financial data used 
in this paper. In Section 3, we introduce the methodol- 
ogy. In Section 4, we present the results in this investi- 
gation. Finally, we end with the summary. 



III. METHODOLOGY 

We make the network by using stocks listed on the 
S&P500 and KOSPI, respectively, through the MST 
method proposed by Mantegna et al. As the MST 
method makes the network based on the correlation of 
stocks, the cross-correlation of stocks listed on the S&P 
500 and KOSPI stock markets, respectively, is calculated 
as follows 



pij 



(RiRj) — (Ri){Rj) 



^{{R 2 l )-{R l ) 2 ){(R))^{R ] ) 2 ) 



(2) 



where (.) means the mean value of the whole period and 
the correlation lies within the range of — 1 < p^ < +1. If 
Pij is 1, two time series have a complete correlation and 
if pij is -1, they have a complete anti-correlation. In the 
case where p^ is 0, the correlation of two time series is 
0. On the basis of p^ calculated by Eq. [21 the distance 
between nodes is calculated as follows 



(3) 



In order to find out the correlation between the num- 
ber of connections of a individual stock with other stocks 
in the network and the market index, we investigated the 
correlation, pmi, between the market index and individ- 
ual stocks. Using the Eq. [U we calculated the correla- 
tion, piM-, between returns of individual stocks, Ri, and 
the market index, Rm , and defined as follow 



II. DATA 

We used 400 individual daily stocks data from January 
1993 to May 2005 taken from individual stocks listed on 
the S&P500 index of the American stock market (from 
the Yahoo website) and 468 individual daily stocks data 
from January 1991 to May 2003 taken from individual 
stocks listed on the KOSPI of the Korean stock mar- 
ket (from the Korean Stock Exchange). In order to in- 
vestigate the possible factors determined the number of 
links of an individual stock in the network, we used the 
5&;P500 and KOSPI index with the same period as in- 
dividual stocks, respectively. We used the normalized 
returns, Rt, by the standard deviation, <r(r t ), after cal- 
culating the returns from the stock price, Pt, by the log- 
difference, r t = lnP t — lnP t _i, as in previous studies and 
defined as follow 



\nP t -lnP t -i 

o-(n) 



(1) 



PiM 



(RiRm) — (Ri)\RM) 



JiikJ) - {R t ) 2 ){{R M ) - {R M ) 2 ) 



(4) 



where c(r t ) is the standard deviation of the return. 



IV. RESULTS 

In this section, using the MST method we investigated 
the network properties of 400 individual stocks listed on 
the S&P500 index from January 1993 to May 2005, and 
468 individual stocks listed on the KOSPI from January 
1991 to May 2003, respectively. 

First, we analyze the distribution of the correlation 
matrix, p^, for individual stocks in the S&P500 index 
and KOSPI. In Fig. [TJ we shows the distribution of 
the correlation matrix for both stock markets. The blue 
(square), red (circle), green (diamond), and black (trian- 
gle) indicates the S&P500, KOSPI, S&P500 (surrogate), 
KOSPI (surrogate) stock markets, respectively. We find 
that the correlation distribution, p^ between stocks in 
the S&P500 index shows a very similar with that be- 
tween stocks in the KOSPI and those deviate from the 
correlation distribution of time series created by the sur- 
rogate method. In order to estimate the topology struc- 
ture of both stock markets, we employ the MST method 
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FIG. 1: The PDF of the cross-correlation p,j between the 
stocks listed on for the S&P500 and KOSPI stock markets. 
The red (circle), blue (square), green (diamond), and black 
(triangle) denotes the KOSPI, S&P500, KOSPI (surrogate), 
and S&P500 (surrogate) stock market, respectively. 



proposed by the Mantegna et al. Using the whole pe- 
riod data of the Korean and American stock markets, 
we present the network structure calculated by the MST 
method and its the degree distribution for both the stock 
markets. Fig. fj] shows the network structures between 
stocks generated by the MST method, using individual 
stock data lised on the American and Korean stock mar- 
ket, respectively and plot the degree distribution, P(k), 
of the MST networks for both stock markets. In Fig. [21 
(a) and (b) display the MST structure composed of daily 
returns of the individual companies lised on the S&P500 
index and KOSPI, respectively and (c) and (d) show the 
degree distribution of the MST structure for both stock 
markets in the log-log and linear- log plot. The red (cir- 
cle), blue (square), green (diamond), and black (trian- 
gle) indicates the KOSPI, S&P500, KOSPI (surrogate), 
S&P500 (surrogate), respectively and the notation (sur- 
rogate) denotes the corresponding surrogate data. 

We find that the degree distribution for both stock 
markets follows a power law distribution with the expo- 
nent ( ~ 2.1, while the degree distribution of MST net- 
work of the time series created by the surrogate method 
[poT | does follows an exponential distribution with S ~ 
0.77. Thus, as the results finding in the complex network 
such as the internet, WWW, protein-protein interaction, 
and so on, there is a scale-free network property. There- 
fore, the hubs existence in the financial markets means 
that there are a dominant company which gives many 
influence to the other stocks. In order to observe a possi- 
ble factor determination the degree of an individual stock 
on the MST network, we calculated the correlation, p^M , 
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FIG. 2: (a) MST structure composed of daily returns of 400 
individual companies lised on the S&P500 index from 1993 to 
2005, and (b) MST structure composed of the daily returns of 
468 individual corporations listed on the KOSPI from 1991 to 
2003. (c) shows the degree distribution of the MST structure 
composed of individual companies on the S&P500 index and 
KOSPI. The degree of both the S&P500 index and KOSPI 
follows a power law distribution with the exponent of £ ~ 2.1. 
The red (circle), blue (square), green (diamond), and black 
(triangle) denotes the KOSPI, S&P500, KOSPI (surrogate), 
S&P500 (surrogate), respectively. 



between an individual stock with the degree k and the 
stock market index for both stock markets. 

In Fig. [3l we shows the distribution of the correlation, 
PiMi between stocks in the S&P500 index and KOSPI, 
respectively. The blue (square) and red (circle) indicates 
the S&P500 and KOSPI. 

We find that the correlation, pm , between a stock with 
the degree k, the number of connected with other stocks, 
and the market index for both stock markets follows a 
power-law distribution, (pijwf(fc)) ~ fc 7 , with the expo- 
nent 7S&P500 ~ 0.16 and 7kospi ~ 0.14, respectively. 
Thus, through these results, we found that the stocks 
having more links with other stocks are more highly cor- 
related with the market index than those with relatively 
fewer connections. In other words, the stocks closely re- 
lated to the market index have a larger number of links 
with other stocks and are likely to be located around the 
center of the network of the stocks. On the other hand, 
the stocks poorly related to the market index have fewer 
links with other stocks and tend to be placed at the out- 
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FIG. 3: Distribution of the correlation, piM, between an in- 
dividual companies and market index for both the S&P500 
index and KOSPI, respectively. 

side of the network of the stocks. Our results suggest 
that the common factor such as the market index play 
a important rules in terms of determing the networks in 
the financial markets. 



V. CONCLUSIONS 

In this paper, in order to examine possible factors 
capable of affecting the number of links that a specific 
stock has in relation to other stocks in the network be- 
tween stocks created using the MST method, we car- 
ried out research using the market index, a representa- 



tive among multiple common factors mentioned in the 
arbitrage pricing model (APM). We used 400 individual 
stocks listed on the S&P 500 index and 463 stocks listed 
on the KOSPI. 

We found that the correlation distribution, pij, be- 
tween stocks in the S&P500 index shows a very simi- 
lar with that between stocks Used on the KOSPI and 
those deviate from the correlation distribution of time 
series removed a nonlinearity by the surrogate method. 
We shows that the degree distribution in the network 
between stocks obtained by the MST method for both 
stock markets follows a power-law distribution with the 
exponent Cs&psoo, kospi ~ 2.1, while the degree distri- 
bution from the time series eliminated a nonlinearity 
follows an exponential distribution with the exponent, 
<^S&P500 (surrogate), kospi (surrogate) ~ 0.77. In order to in- 
vestigate a factor determining the degree k on the MST 
network, we used the market index for both stock mar- 
kets. We found that in the degree distribution, the cor- 
relation, piM, between the degree k, the number of links, 
and the market index for both stock markets follows a 
power-law distribution, {piM^k)) ~ fc 7 , with the expo- 
nent 7s&P5oo ~ 0.16 and 7kospi ~ 0.14, respectively. In 
other words, the stocks having the intimate relation with 
the market index have a larger number of links, while 
the stocks poorly related to the market index have fewer 
links. According to above finding results, we imply that 
the degree k as most important quantity to describe the 
network topology, has a closely relation with the common 
factors such as market index. 
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